AITopics | analyzing hidden representation

Collaborating Authors

analyzing hidden representation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems

Neural Information Processing SystemsNov-21-2025, 15:57:38 GMT

Neural networks have become ubiquitous in automatic speech recognition systems. While neural networks are typically used as acoustic models in more complex systems, recent studies have explored end-to-end speech recognition systems based on neural networks, which can be trained to directly predict text from input acoustic features. Although such systems are conceptually elegant and simpler than traditional systems, it is less obvious how to interpret the trained models. In this work, we analyze the speech representations learned by a deep end-to-end model that is based on convolutional and recurrent layers, and trained with a connectionist temporal classification (CTC) loss. We use a pre-trained model to generate frame-level features which are given to a classifier that is trained on frame classification into phones. We evaluate representations from different layers of the deep model and compare their quality for predicting phone labels. Our experiments shed light on important aspects of the end-to-end model such as layer depth, model complexity, and other design choices.

analyzing hidden representation, end-to-end automatic speech recognition system, name change, (3 more...)

Neural Information Processing Systems

Genre: Research Report (0.98)

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Add feedback

Reviews: Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems

Neural Information Processing SystemsOct-8-2024, 06:42:30 GMT

The authors conduct an analysis of CTC trained acoustic models to determine how information related to phonetic categories is preserved in CTC-based models which directly output graphemes. The work follows a long line of research that has analyzed neural network representations to determine how they model phonemic representations, although to the best of my knowledge this has not been done previously for CTC-based end-to-end architectures. The results and analysis presented by the authors is interesting, although there are some concerns I have with the conclusions that the authors draw that I would like to clarify these points. Please see my detailed comments below. In the paper, the authors conclude that (Line 159--164) "... after the 5th recurrent layer accuracy goes down again. One possible explanation to this may be that higher layers in the model are more sensitive to long distance information that is needed for the speech recognition task, whereas the local information which is needed for classifying phones is better captured in lower layers."

analyzing hidden representation, end-to-end automatic speech recognition system, information, (7 more...)

Neural Information Processing Systems

Country: North America > United States > Arizona > Maricopa County > Scottsdale (0.05)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.31)

Add feedback

Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems

Belinkov, Yonatan, Glass, James

Neural Information Processing SystemsFeb-14-2020, 10:25:52 GMT

analyzing hidden representation, classification, end-to-end automatic speech recognition system, (1 more...)

Neural Information Processing Systems

Genre: Research Report (0.86)

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Add feedback